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ABSTRACT 

Human DNA Pol k is a polymerase enzyme, speciali- 
zed for near error-free bypass of certain bulky 
chemical lesions to DNA that are derived from envir- 
onmental carcinogens present in tobacco smoke, 
automobile exhaust and cooked food. By employing 
ab initio QM/MM-MD (Quantum Mechanics/ 
Molecular Mechanics-Molecular Dynamics) simula- 
tions with umbrella sampling, we have determined 
the entire free energy profile of the nucleotidyl 
transfer reaction catalyzed by Pol k and provided 
detailed mechanistic insights. Our results show that 
a variant of the Water Mediated and Substrate 
Assisted (WMSA) mechanism that we previously 
deduced for Dpo4 and T7 DNA polymerases is 
preferred for Pol k as well, suggesting its broad ap- 
plicability. The hydrogen on the 3 -OH primer terminus 
is transferred through crystal and solvent waters to 
the y-phosphate of the dNTP, followed by the asso- 
ciative nucleotidyl transfer reaction; this is facilitated 
by a proton transfer from the y-phosphate to the 
ot,p-bridging oxygen as pyrophosphate leaves, to neu- 
tralize the evolving negative charge. MD simulations 
show that the near error-free incorporation of dCTP 
opposite the major benzo[a]pyrene— derived dG 
lesion is compatible with the WMSA mechanism, 
allowing for an essentially undisturbed pentacovalent 
phosphorane transition state, and explaining the 
bypass of this lesion with little mutation by Pol k. 

INTRODUCTION 

The Y family translesion synthesis (TLS) DNA polymer- 
ases are generally low fidelity on undamaged DNA (1) and 



have functions which include, among others, to bypass 
distorting bulky DNA lesions (2-7) that primarily stall 
high fidelity replicative polymerases; one or more 
members of the Y family TLS polymerases may then 
replace the high fidelity polymerase to bypass the region 
of distortion (7-9). This bypass may be error-free or 
error-prone, depending on the nature of the lesion and 
the specific polymerase (1-3,7). 

Human Pol k, a member of the Y-family polymerases, 
has the striking capacity for near error-free bypass in vitro 
(10-12) and in mammalian cells (13), of bulky lesions linked 
to the amino group of guanine that reside in the B-DNA 
minor groove. Particularly interesting is a series of studies 
of the major benzo[a]pyrene (B[a]P)-derived adduct, 
\QS(+)-trans-anti-B[a]P-N 2 -dG (B[a]P-dG) (Figure 1). 
Benzo[a]pyrene is an ubiquitous environmental carcinogen, 
present in tobacco smoke, automobile exhaust and cooked 
foods (14-16), and the B[a]P-dG adduct is highly mutagenic 
(17,18). The studies showed mostly error-free by- 
pass of this carcinogen-induced lesion by Pol k in vitro 
(11,19-22). Furthermore, in mouse embryonic fibroblasts 
it has been shown that Pol k bypasses this adduct with 
higher efficiency and accuracy than other Y family poly- 
merases (23), and mutagenic bypass depends on other 
polymerases in the system (24). In addition, it was shown 
that mouse embryonic stem cells are protected against 
mutations targeted to guanine by the diol-epoxide 
metabolite (B[cr]PDE) of B[a]P (25). NMR solution 
studies have shown that the B[a]P-dG adduct resides 
in the B-DNA minor groove, directed 5' along the 
modified strand (17,26-29). While minor groove lesions 
are bypassed near error-free by Pol k, lesions residing on 
the major groove side tend to cause the polymerase to stall 
(11,30,31). 

Structurally, high fidelity replicative and the Y family 
bypass polymerases, including Pol k, share common 
features: they are shaped like a hand with palm, fingers 
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10S (+)-trans-anti-B[a]P-N 2 -dG 

Figure 1. Structure of the 105 (+)-trans-anti-B[a]P-N 2 -dG (B[a]P-dG) 
adduct. The absolute configurations of the four chiral atoms C7, C8, C9 
and CIO are indicated. Torsion angles are defined as x = 04'(dR)-Cl'(dR) 
-N9-C4(dRisdeoxyribose),oe' = Nl -C2-Af 2 -C10(B[fl]P-dG), (3' = C2 
- N 2 - C10(B[a]P-dG) - C9(B[a]P-dG). 




Figure 2. Active site of Pol k ternary complex ((32) and PDB ID: 
20H2) remodeled with MD as described in Computational Methods 
section. This structure was utilized to initiate the QM/MM-MD simu- 
lations. The octahedral coordination of the two Mg +2 ions is shown 
with the dashed lines and their distances are shown in Supplementary 
Figure S2. Hydrogen atoms are not displayed for clarity except for the 
H(03') and the crystal water hydrogens. 



and thumb domains; however, the bypass polymerases all 
have an additional domain known as the little finger or 
PAD (Polymerase Associated Domain) (2,6). Another 
notable difference is the absence of an induced fit mech- 
anism upon the entry of a dNTP. In high fidelity polymer- 
ases the fingers close so that the nascent base pair is in van 
der Waals contact with the polymerase prior to reaction. 
In Y-family polymerases the active site is preformed, 
without the opening/closing induced fit motion in going 
from the binary to the ternary complex; the active site is 
more open and solvent exposed, particularly on the minor 
groove side of the nascent duplex (1,2,4). These structural 
differences account for the low fidelity of the bypass poly- 
merases on undamaged DNA and also their functionality 
in lesion bypass, in which only small numbers of nucleo- 
tides are synthesized to transit the region of lesion-induced 
distortion (1,2,7). DNA polymerase k is unique in having 
an additional domain, known as the N-Clasp (an N- 
terminal extension of ~75 residues) on the major groove 
side of the nascent duplex; as a result a region of the DNA 
near the active site is completely encircled by Pol k, as 
shown in the crystal structure containing the ternary 
complex with template-primer and incoming dNTP; 
without the N-Clasp DNA synthesis by Pol k is severely 
impaired (32). 

Pol k has an active site for catalysis of the nucleotidyl 
transfer reaction that shares common features with all 
DNA polymerases. Overall, the 3'-OH group on the 
sugar of the primer terminus reacts with the a-phosphate 
of a deoxyribonucleoside triphosphate, pyrophosphate is 
released and the chemical step of nucleotide insertion is 
achieved. A pair of universally conserved divalent metal 
ions (normally Mg +2 ) has been suggested to play key cata- 
lytic roles in this essential process: the catalytic ion, Mg +2 
A (Figure 2), lowers the pKa of the 3'-OH and facilitates 
the nucleophilic attack; the nucleotide-binding ion, Mg +2 
B, assists the leaving of the pyrophosphate (33). Both ions 
organize the active site in octahedral coordination with a 
triad of conserved carboxylate-containing amino acid 



residues, water, the primer 03' and phosphate oxygens 
of the dNTP (33). A high resolution view of a 
well-organized active site in Pol (3 was obtained by Batra 
and coworkers (Supplementary Figure SI) (34); this struc- 
ture contained the 3'OH group of the sugar and utilized a 
non-hydrolyzable deoxynucleotide analog to inhibit the 
extension reaction, instead of the dideoxy termini that 
had been commonly employed and did not provide 
organized active sites. 

A Water Mediated and Substrate Assisted mechanism 
(WMSA) has been previously delineated for the 
low-fidelity lesion-bypass polymerase Dpo4 (Y-family) 
and the high-fidelity replicative DNA polymerase T7 
(A-Family) (35,36). The mechanistic details revealed a 
stepwise mechanism with essential common features: the 
general base for the H(03') is the y-phosphate of the 
dNTP; this protonation takes place first via proton 
transfer through a shuttle of two mediating water mol- 
ecules. The subsequent associative nucleophilic attack of 
the 03' on P a is accompanied by a transfer of the proton 
from the y-phosphate to the oc,P-bridging oxygen via a 
water molecule. This mechanism is consistent with the 
work of Castro, Cameron and coworkers (37) which 
revealed that there are two proton transfers in the 
overall chemical reaction in a number of polymerases, 
one of which was suggested to correspond to the 
deprotonation of the 3'-OH and the other to protonation 
of the pyrophosphate leaving group. 

The goal of the present work was to determine if a 
WMSA-type mechanism was feasible for human Pol k 
and to gain insights on the near error-free lesion bypass 
of the B[//]P-dG adduct by Pol k. We thoroughly explored 
the nucleotidyl transfer mechanism of undamaged Pol k 
and assessed whether the characterized mechanism 
obtained in our extensive searches remained feasible in 
the presence of the lesion. We used advanced ab initio 
quantum mechanics/molecular mechanics-molecular 
dynamics simulations (QM/MM-MD) with the umbrella 
sampling method (38,39) to calculate free energy profiles 
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for the mechanisms investigated. At each MD time step, 
the atomic forces as well as the total energy of the enzyme 
system are calculated with the pseudobond ab initio QM/ 
MM method (40,41), in which the bond forming/breaking 
process is treated by the ab initio QM method, while the 
rest of the system is described by the molecular mechanical 
force field; the pseudobond approach provides a robust 
treatment of the QM/MM boundary across covalent 
bonds. From a series of biased simulations (42), which 
are employed to enhance the sampling of lower probabil- 
ity states, the free energy profile along the reaction coord- 
inate is obtained with the weighted histogram analysis 
method (WHAM) (43^15). This state-of-the-art 
approach provides a first-principle QM description of 
the chemical reaction, adequately accounts for the biolo- 
gical environment, and takes account of the fluctuations 
of the reaction active site and the surrounding enzyme 
system. The method has been demonstrated to be 
powerful in determining the mechanism of various 
enzymes (38-43). 

Previous molecular dynamics (MD) simulations have 
shown that in the ternary complex prior to reaction the 
B[a]P-dG adduct, linked to a templating guanine, is well 
accommodated on the minor groove side in Pol k; a dCTP 
incoming nucleotide maintains normal Watson-Crick 
pairing with the damaged guanine, and the active site 
appears well-organized (30). By contrast, the analogous 
adenine adduct (105 (+)-trans-anti-B[a]P-N 6 -dA), directed 
towards the N-clasp on the major groove side of the 
evolving duplex, probably impedes translocation sterically, 
preventing the damaged template from entering the inser- 
tion site; this is consistent with experimentally observed 
blockage at the pre-insertion site (11,30). Our present 
results showed that the WMSA mechanism provides the 
lowest free energy path for the reaction among the 
multiple mechanisms we investigated for Pol k; further- 
more, the B[a]P ring system in the B[a]P-dG adduct does 
not disrupt the pentacovalent phosphorane transition state 
nor the water network required for the proton transfer 
steps that are essential for the mechanism. 

COMPUTATIONAL METHODS 

Scheme 1 summarizes our computational protocol. 

Structure preparation 

The ternary crystal structure (PDB ID (46): 20H2) of Pol 
K i9-526 whh DNA and incoming dTTP (32) was the basis 
for our prepared enzyme-substrate model. Details are 
given in Supplementary Data including Supplementary 
Scheme SI. 

Classical MD 

The initial model was subjected to 3 ns of MD simulations 
using AMBER 9 (47). We employed the Amber99SB 
(48-50) force field with modification for DNA by 
parmbscO (51), and previously calculated parameters for 
the dCTP (52). The structure was neutralized by 10 Na + 
counterions and was solvated with a periodic rectangular 
box of TIP3P water (53,54) with 10 A buffer around the 
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Scheme 1. 



enzyme-substrate complex. Details of the MD protocols 
are given in Supplementary Materials. A random snapshot 
from the last 0.5 ns of the 3 ns trajectory was utilized for 
the subsequent QM/MM-MD stage. In this snapshot the 
Mg +2 coordination, Mg +2 -Mg +2 distance, and 03'-P a 
distance were similar to those in the high resolution 
X-ray crystal structure of pol p (34), in our previous Pol 
k model (30), and in the initial models utilized for our 
previous polymerase mechanism studies with Dpo4 (35) 
and T7 (36) (shown in Figure 2 and Supplementary 
Figure S2). 

QM/MM and QM/MM-MD 

The QM subsystem (81 atoms) was comprised of the 
dCTP, the primer 3'-nucleotide (excluding the C5' phos- 
phate group), the glutamic acid (El 99), the catalytic and 
nucleotide-binding Mg +2 ions (Mg +2 A and Mg +2 B, re- 
spectively), and two water molecules (Supplementary 
Figure S3). While the QM subsystem did not include all 
the ligands of the two Mg + ions (Figure 2), the critical 
ones for the chemistry of the nucleotidyl transfer reaction 
are included: El 99, the crystal water molecule, the attack- 
ing 03' and the dNTP phosphate oxygens. The other 
ligands, which do not participate in the chemistry per se 
but act to maintain the active site structure in a 
reaction-ready organization, are simulated through the 
MM component of our computations. Hence, the overall 
treatment of the Mg +2 ion ligands is sufficiently robust. 
This subsystem was treated by the hybrid density func- 
tional B3LYP (55-57) with a medium split valence basis 
set and polarization functions 6-31G* (58,59). All other 
residues and the surrounding water molecules (11774 
atoms), constituting the MM subsystem, were described 
by the TIP3P water model and the modified Amber99SB 
force field that was used in the classical MD simulations. 
Spherical boundary conditions were applied in all QM/ 
MM calculations and all atoms outside the 20 A radius 
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centered at theprimer 3'-end oxygen atom were frozen. A 
cutoff of 12.0 A was used for van der Waals interactions, 
and a cutoff of 18.0 A was employed for electrostatic inter- 
actions among the MM atoms. There was no cutoff for 
electrostatic interactions between the QM and MM atoms. 
Solvent molecules outside of a 27 A sphere centered at the 
primer 3'-end oxygen atom were removed. The 
pseudobond approach (40,41) with improved parameters 
(60) was employed to treat the partition between QM and 
MM subsystems (Supplementary Figure S3). All QM/MM 
calculations were carried out utilizing the modified 
QChem (61) and Tinker (62) programs. The reactant 
was minimized by the efficient iterative optimization 
QM/MM approach. We mapped the minimum energy 
path for the investigated mechanisms utilizing the 
reaction coordinate driving method (42). For each 
determined structure along the path, a 500 ps MD simu- 
lation with the MM force field was carried out to equili- 
brate the MM sub-system. The resulting snapshot was 
used as the starting structure for ab initio QM/MM-MD 
simulations with umbrella sampling (38,39,63) that applies 
a harmonic potential to constrain the reaction coordinate 
(RC) at successive values. In order to ensure sufficient 
overlap between the successive windows, force constants 
in the range of 0 to 300 kcal/mol-A 2 were employed; these 
were chosen based on the estimated slope of the free 
energy curve at the reaction coordinate. The potential 
of mean force (PMF) was calculated from the probability 
distributions along a reaction coordinate using WHAM 
(43-45). 

Pol k containing the B|«|P-dG lesion in the pentacovalent 
phosphorane transition state 

For the studies involving the B[a]P-dG adduct we began 
with the pentacovalent phosphorane transition state struc- 
ture of the characterized WMSA mechanism. We modeled 
the B[a]P-dG lesion by docking the B[a]P moiety to the 
amino group of the templating guanine, placing the B[a]P 
on the minor groove side in Pol k, oriented in the 5'-direc- 
tion of the modified strand as in the NMR solution struc- 
ture (26). Because the transition state contains five covalent 
bonds about the a-phosphate, it was necessary to develop 
an Amber-compatible force field to simulate this transient 
chemical state for the unmodified Pol k. A new residue 
comprised of the incoming dCTP and the 3'-terminal nu- 
cleotide of the primer DNA strand was defined. This 
residue contained four new atom types: one for the 
protonated y-phosphate oxygen of the dCTP, a second 
for the y protonating hydrogen, a third for the a-phosphate 
of the dCTP and the fourth atom type was defined to 
describe both the dCTP a,B-bridging oxygen and the 
primer 03'. Topology assignments, geometries and partial 
charge parameters were characterized on the basis of the 
QM/MM-MD calculations for the transition state (data 
are listed in Supplementary Tables SI and S2). The force 
constants were chosen to be large enough to maintain the 
transition state bond lengths, bond angles and dihedral 
angles in the unmodified Pol k. Partial charges were taken 
from the QM calculations. The force field parameters for 
the guanine lesion, B[a]P-dG, were taken from Jia and 



coworkers (30). The MD protocol was the same as 
described in Supplementary Data. 

All initial model constructions were carried out using 
the INSIGHT II 2005 program (Accelrys Software, Inc.). 
PyMOL (Delano Scientific, LLC) (64) was employed to 
make molecular images and movies. 

RESULTS 

We investigated many reaction schemes which considered 
the destination and pathways open to the H(03') as it is 
taken up by a base, the roles of amino acid residues in the 
active site in the mechanism, and the order of events in the 
nucleotidyl transfer reaction. The minimum energy path 
was calculated for each scheme employing the reaction 
coordinate driving method and B3LYP(6-31G*) QM/ 
MM calculations. For several mechanisms two- 
dimensional minimum potential energy surfaces were 
mapped in order to locate an optimal path with the 
lowest energy barriers and stable intermediates (Scheme 
1 and Supplementary Figures S4B, S5B and S6E). The 
most promising schemes were further investigated with 
B3LYP(6-31G*) QM/MM-MD simulations. This hier- 
archy of strategies allowed us to examine numerous mech- 
anistic paths (Scheme 1). The lowest energy pathway that 
we obtained was a variant of the WMSA mechanism, 
while those that involved an amino acid as the base and/ 
or utilized an amino acid to protonate the dCTP 
y-phosphate were unfavorable due to uphill paths or 
unstable intermediates. 

In the WMSA mechanism the y-phosphate of the 
incoming dNTP serves as the base for the H(03') which 
shuttles through water molecules 

We found that a variant of the WMSA mechanism, which 
was previously proposed for the bypass DNA polymerase 
Dpo4 (35) and the high-fidelity replicative T7 DNA 
polymerase (36), is preferred for the human bypass 
DNA polymerase k over the many explored schemes. 
Two steps are involved in this mechanism. In the first 
step, the H of 03' is transferred to the oxygen on the 
y-phosphate of the dCTP via two mediating water mol- 
ecules (Figures 3 and 4). Hydrogen bond analysis revealed 
stable interactions throughout the reaction pathway 
between the (3-phosphate group of the dCTP with 
Thrl38, Argl44 and Phelll, while the y-phosphate 
group hydrogen bonds with Argl44 and Lys328 and 
with the Ala 110 backbone (Figure 5 and Supplementary 
Figure S7). The y-phosphate has the most negative partial 
charge among the phosphate groups of dCTP (52), and 
this is relieved by protonation (Figure 6). In addition, the 
a-phosphate maintained stable hydrogen bonds with the 
two water molecules and helped anchor the second water 
(not coordinated with Mg +2 ) throughout the reaction. The 
proton transfers in this shuttle are: from the 03' to the 
crystal water molecule that participates in the Mg +2 co- 
ordination; from this water to a solvent water, that is 
stably held by hydrogen bonds with oxygen atoms of the 
substrate, and from the second water to the y-phosphate 
of the dCTP (Figures 3 A and 4). The crystal water found 
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Figure 3. WMSA mechanism: (A, B) Reaction schemes and (C) Free energy profile. R: reactant; TS1: transition state 1; I: intermediate; TS2: 
transition states 2; P: product. These are illustrated in Figure 4. 



in Pol k (32) is also observed in a number of other poly- 
merases (34,65-70). In the organized active site of Pol p 
(34) and in several other DNA polymerases (66,68-70) a 
second crystal water is located at the same position as the 
solvent water (WAT2) in our model (Supplementary 
Figure SI). A free energy profile was calculated for the 
first step of this mechanism by employing 37 umbrella 
windows along the reaction coordinate, each simulated 
for 30 ps of B3LYP(6-31G*) QM/MM-MD. An activa- 
tion barrier of 11.2 ± 0.4 kcal/mol was determined for 
this free energy profile (Figure 3C). (Supplementary 
Movie SI). 

In an associative mechanism a pentacovalent phosphorane 
is the transition state of the nucleotidyl transfer reaction 

With a protonated y-phosphate as a stable intermediate 
the reaction proceeds to the second step with a nucleo- 
philic attack of the naked 03' on the a-phosphate and 
leaving of the pyrophosphate (Figures 3B and 4). The 
transition state is a pentacovalent phosphorane structure, 
which results from an associative nucleotidyl transfer 
reaction; that is, the 03-Pq, bond forms before the 
Pa-O ai p_bridge breaks (Supplementary Figure S8). The free 
energy barrier obtained for this step is 10.6 ± 0.3 kcal/mol, 
calculated using 50 umbrella sampling windows 
along the reaction coordinate each for 35 ps of 
B3LYP(6-31G*) QM/MM-MD (Figure 3C) (Supple- 
mentary Movie S2). As the reaction progresses the 
distance between the catalytic Mg +2 A and the nucleo- 
tide-binding Mg +2 B becomes shorter and achieves a 
minimum of 3.59 ± 0.09 A at the transition state 
(Supplementary Figure S2A) which promotes the 



formation of the 03'-P a bond; it subsequently elongates 
again to facilitate the breaking of the P a -O ai p_b r idge bond. 
By approaching metal B, metal A brings the nucleophile 
03' within attack distance of the P a , as was suggested by 
Yang et al. (71). 

Stabilization of the pentacovalent phosphorane transition 
state involves interactions with nearby amino acid residues 

We wished to elucidate the roles of the amino acid residues 
in the active site in facilitating the WMSA mechanism. 
We analyzed hydrogen bonding interactions, and 
calculated intermolecular interactions (including van der 
Waals and electrostatic energies). We also analyzed group 
partial charges of each of the three phosphates in the 
dCTP along the reaction pathway for the nucleophilic 
attack. We compared the transition state values with 
those for the intermediate in which the y-phosphate was 
protonated but phosphoryl transfer had not been 
initiated, in order to reveal the specific interactions that 
stabilize the transition state. 

Key hydrogen bonding interactions present in the transi- 
tion state are shown in Figure 5A. The partial charge 
analysis showed that negative charge builds up on the a 
and also to some extent on the P phosphates as the 
pentacovalent phosphorane forms during the nucleophilic 
attack (Figure 6). This motivates the proton on the 
y-phosphate to migrate toward the oc,P-bridge. Conse- 
quently the y-phosphate group becomes highly negatively 
charged (Figure 6). This negative charge provokes stronger 
electrostatic interactions between Argl44, Tyrl41 and 
Lys328 and the y-phosphate group compared to their 
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Figure 4. Key structures of the nucleotidyl transfer reaction catalyzed 
by Pol k: reaction mechanism (left) and critical structures (right). See 
Supplementary Movies SI and S2 for mechanism details. 



interactions in the y-protonated intermediate (Figure 7, 
Supplementary Figures S9 and S10). In addition there is a 
water mediated hydrogen bond involving Tyrl41 with the 
y-phosphate throughout both steps of the reaction. In the 
transition state the hydrogen bond is direct without water 
mediation (Supplementary Figure Sll). 

Transition state alignment is facilitated by C3'-endo 
sugar pucker 

The conformation of the sugar pucker of the primer 
terminus through the progression of the reaction is 
shown in Supplementary Figure S12. In the original 
crystal structure, which had a hydrogen in place of the 
3'0H, the sugar pucker was C2'-endo, but upon remodel- 
ing and MD simulation (see 'Computational Methods' 
section) the pucker changed to C3'-endo. Supplementary 
Figure S12 shows the pseudorotation parameter P (72,73) 



during the QM/MM-MD as a function of the reaction 
coordinate. We see that the pucker remains near 
C3'-endo until the neighborhood of the intermediate 
(Figure 4), where it oscillates around Cl'-exo but returns 
to C3'-endo in the transition state, providing the proper 
alignment for the transition state geometry. Crystal struc- 
tures of pre-reactant polymerase ternary complexes have 
been predominantly C3'-endo (34,66,68-70,74-76) except 
for Y-Family polymerases (2). 

Spontaneous proton transfer from the y-phosphate to 
the a,P-bridging oxygen occurs during leaving of the 
pyrophosphate 

Under the influence of the QM/MM-MD the dynamics 
revealed spontaneous proton transfer from the 
y-phosphate to the a, (3 phosphate bridge during the 
leaving of the pyrophosphate (Figure 4). This protonation 
accompanied the nucleotidyl transfer reaction step after 
formation of the pentacovalent phosphorane transition 
state and before the pyrophosphate left completely. The 
leaving of the pyrophosphate is assisted by this proton 
transfer, which relieves the accumulating negative charge 
on the a,p-bridging oxygen during the elongation of the 
P a -O a ,p-brid g e bond (Figure 6). 

Other mechanisms provide unstable intermediates or 
higher energy barriers 

Other pathways that we explored for the deprotonation of 
the H(03') included: ( i) Amino acids as the general base 
for H(03' ). There are two carboxylate-containing amino 
acids within a short distance of the primer (El 99 and 
D198). Both are part of the conserved amino acid triad 
in polymerase active sites (4). We considered the closer 
one, El 99 as a natural candidate for the base that 
absorbs the 03' proton. While the potential energy for 
this transfer revealed a promising activation barrier of 
30.1 kcal/mol, the E199 protonated intermediate was not 
stable and the hydrogen quickly returned to the 03' upon 
an unrestrained QM/MM-MD of 0.5ps (Supplementary 
Figure SI 3). This contrasts with the case for the WMSA 
mechanism discussed above where the 03' proton 
remained stably on the y-phosphate during 35 ps of 
unrestrained QM/MM-MD. We next tried a one-step 
mechanism in which the 03' donates its proton to El 99 
while it simultaneously approaches the a-phosphate 
for the nucleophilic attack. This mechanism always 
produced an uphill reaction energy profile 
(Supplementary Figure SI 4). 

( ii) Amino Acids as donor to protonate the dNTP. 
Experimental evidence indicates that the incoming dNTP 
must become protonated to ease the phosphoryl transfer 
and the leaving of the pyrophosphate, and it has been 
suggested that an amino acid could be the donor in the 
case of certain other polymerases (77). In Pol k a suitable 
donor amino acid could be Argl44 because a guanidino 
group proton is 2.8 A from the y phosphate oxygen 
(Figure 5 and Supplementary Figure S7). Therefore, we 
explored a path that involves protonation of the 
y-phosphate by this Argl44 (Supplementary Figure S15A). 
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Figure 5. Key stabilizing hydrogen-bonding interactions (dashed lines) between active site amino acid residues and the pentacovalent phosphorane 
transition state. (A) Unmodified system simulated with QM/MM-MD (B) Modified system simulated with classical MD. The unmodified system 
simulated with classical MD preserves the same hydrogen bonding interactions as shown in Supplementary Table S3. 
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State 1 State 2 

Figure 6. The a, p and y phosphate group charges during key stationary points of the reaction (Figure 4). The partial charge of the bridging oxygen 
atoms is considered to be evenly shared between the neighboring phosphate groups. 
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Figure 7. Stabilizing interactions (electrostatic and van der Waals) between Pol k amino acid residues and the QM active site region (defined in 
Supplementary Figure S3) for the attack-ready 03' intermediate and the pentacovalent phosphorane transition state (Figure 4). See also 
Supplementary Figures S9 and S10. 
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Another path we explored for the y-phosphate proton- 
ation combined proton donation by Argl44 with El 99 
serving as the base for the H(03') (Supplementary 
Figure S15B). In addition we considered the possibility 
that each of these two paths occurred simultaneously 
with the nucleophilic attack. All produced uphill 
reaction energy profiles (Supplementary Figure S15C). 

( in) The a-phosphate of the dNTP as the base or an inter- 
mediate in the path of H( 03' ) to the y-phosphate. 
Mechanisms involving protonation of the a-phosphate 
were examined: either a direct protonation or protonation 
via a water were considered (Supplementary Figure S6). 
The protonated a-phosphate did not provide a stable 
intermediate as found for the Dpo4 WMSA mechanism 
(35); upon a short unrestrained QM/MM-MD the 
hydrogen went back to the primer 03' to produce the 
reactant structure. By contrast the protonated 
y-phosphate intermediate in the WMSA mechanism was 
stable during 35 ps of unrestrained QM/MM-MD 
simulation. 

Disfavored mechanisms remain so irrespective 
of H(03) orientation 

We made an extensive investigation of the H(03') orien- 
tation and its effect on the reaction mechanisms that we 
studied. We carried out a survey of the free energy profile 
of the C2'-C3'-03'-H torsion angle which governs the 
orientation of the hydrogen of the 03'. This was carried 
out with AMBER 9 (47) by restraining this torsion angle 
at 10° intervals from 0° to 360° with 2 ns MD simulations 
for each window, utilizing a force constant of 30kcal/ 
mol-rad 2 . The WHAM method was then utilized to 
obtain the free energy profile (78). Figure 8 shows 
minima at 45°, 177° and 309° with relative energies of 
2.6, 7.8 and Okcal/mol, respectively. Our results showed 
that for the WMSA mechanism it was necessary for the 
H(03') to be near the 60° energy well, which allows the 
proton to shuttle through the water molecule, as in our 
earlier work (35,36); however, the disfavored mechanisms 
remained disfavored irrespective of the H(03') orientation 
(e.g. Supplementary Figure S14). 
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Torsion Angle (Degrees) 

Figure 8. Free energy profile of the torsion angle C2'-C3'-03'-H 
determining the orientation of the H(03')- At the 45° minimum the 
hydrogen is oriented towards the a-phosphate while at the 309° 
minimum it is directed toward E199. 



The pentacovalent phosphorane transition state structure 
is preserved in the presence of the B|fl]P-dG adduct 

We wished to evaluate whether the pentacovalent 
phosphorane transition state in our characterized 
WMSA mechanism is preserved in the presence of the 
B[a]P-dG adduct, modeled on the minor groove side of 
the templating base in Pol k, as in the NMR solution 
structure (26). For this purpose, we developed a new 
Amber-compatible force field for the transition state struc- 
ture in the unmodified Pol k (see 'Computational 
Methods' section and Supplementary Tables SI and S2). 
Transition states are extremely transient, with a lifetime of 
the order of 0.01 to 0.1 ps, in the same time frame as a 
single bond vibration (79). We carried out an MD simu- 
lation of the transition state structure in the presence of 
the B[a]P-dG adduct, and for an unmodified control tran- 
sition state structure. We wished to determine if the geo- 
metric properties of the QM/MM-MD determined 
transition state for the unmodified Pol k are preserved 
when the lesion is present. Supplementary Figure SI 6 
and Supplementary Table S3 shows that the characteristic 
phosphorane transition state geometry, after 1 ns of clas- 
sical MD, is preserved as in the QM/MM-MD transition 
state for both unmodified and modified systems. Figure 5 
shows the QM/MM-MD transition state and that of the 
B[c/]P-containing structure simulated by MD. These data 
reveal that the transition state is maintained in the 
presence of the lesion as well as it is in the unmodified 
case, and as found in the QM/MM-MD computations 
for the unmodified system. Figures 9 and 10, and 
Supplementary Movie S3 show the adduct structures 
prior to reaction and in the transition state after the 1 ns 
MD, revealing that they are very similar. Supplementary 
Figure S17 shows the linkage site torsion angles x, a' and 
P'; 242 ± 12°, 204 ± 39° and 58 ± 9°, respectively (as 
defined in Figure 1) that govern the carcinogen orientation 
on the minor groove side. They are very close to those 
obtained previously by Jia and coworkers, who performed 
MD simulations (30) for this adduct positioned in Pol k 
on the templating guanine opposite dCTP prior to the 
nucleotidyl transfer reaction (x, a' and P'; 225 ± 13.0°, 
187 ±8.1° and 64.4 ± 9.4°, respectively). These results 
indicate that the lesion does not interfere with the forma- 
tion of the transition state as the reaction progresses from 
its initial state. 



DISCUSSION 

We have carried out a very comprehensive investigation of 
the mechanism for the nucleotidyl transfer reaction in 
human DNA Pol k utilizing cutting edge ab initio QM/ 
MM-MD methods. Examined mechanisms considered the 
identity of the general base for the H(03') and different 
possibilities for the protonation of the dCTP. The roles of 
the amino acids in the vicinity of the dCTP in these 
investigated mechanisms were extensively explored, as 
were the functions of crystal and solvent waters. Our 
previous work utilizing QM/MM methods determined 
that a WMSA mechanism had the lowest free energy 
reaction path for the Y-family polymerase Dpo4 (35) 
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Figure 9. Pol k containing the B[«]P-dG adduct in the minor groove at the pentacovalent phosphorane transition state. View is towards the major 
groove and emphasizes the maintained Watson-Crick base pairing between B[«]P-dG and the incoming dCTP. (A) Zoom-out view (B) Close-up view. 
Color scheme: N-clasp domain, blue; B[a]P ring system, red; lesion-containing guanine, cyan; Primer-linked dCTP, pink. Watson-Crick hydrogen 
bonds between B[«]P-dG and dCTP, black dashed lines. The pentacovalent phosphorane is indicated by an arrow. See Supplementary Movie S3. 



and the A-family polymerase T7 (36). Here we used the 
more advanced QM/MM-MD method which has the ad- 
vantage of taking account of dynamics of the enzyme 
active site and its environment on an equal footing. 

We find that a variant of the WMSA mechanism again 
provides the lowest free energy path and that all other 
mechanisms investigated were not energetically feasible. 
The mechanism proceeds in two steps: (i) deprotonation 
of the H(03') with nearly concerted shuttling of the 
proton through a crystal and a solvent water to the 
y-phosphate of dCTP; the latter acts as the general base 
and the barrier is 11.2 ± 0.4 kcal/mol (Supplementary 
Movie SI); (ii) nucleophilic attack of 03' on the 
a-phosphate of the dCTP in an associative manner, to 
produce a pentacovalent phosphorane transition state; 
this is followed by pyrophosphate leaving accompanied 
by proton transfer from the y-phosphate to the 
a,P-oxygen bridge. The barrier for the second step is 
10.6 ± 0.3 kcal/mol (Supplementary Movie S2). These 
barriers may be compared with the experimentally 
observed k cat values in the range of 0.46 to 0.28 s -1 
(10,80-82); this k cat range corresponds to a barrier upper 
bound for the reaction of 18.7 to 19.0 kcal/mol according 
to transition state theory: k cat (T) = k B T/h exp[— AG ++ (T)j 
RT] (where k cat is the catalytic rate constant, K B is the 
Boltzmann constant, h is the Planck constant, R is the 
universal gas constant, T is the temperature (31 OK) and 
AG ++ is the free energy of activation). The mechanism is 
essentially the same as the WMSA mechanism determined 
for both T7 and Dpo4, with minor differences in paths 
taken by the migrating protons. However, the barriers 
for steps (i) and (ii), respectively were 13.0 and 10.5 kcal/ 
mol for Dpo4 and 14.2 and 5.5 for T7, indicating that 
deprotonation of the H(03') was rate limiting in those 
two cases. In Pol k the barriers for the two steps are 
very close in energy, although step (ii) appears to be 



slightly lower energy. Our current results point to the 
likelihood that the WMSA mechanism could be widely 
applicable, but that the details can differ in different poly- 
merases as governed by the specific local structural details. 

Overall, the WMSA mechanism is consistent with the 
experimental findings that the chemical reaction in several 
polymerases entails transfer of two protons, involving 
3'OH deprotonation and protonation of the leaving pyro- 
phosphate (37,77). However, for Pol k our extensive in- 
vestigations did not locate a low energy pathway that 
utilized a basic amino acid residue in the active site as a 
general acid to donate a proton to pyrophosphate, as ex- 
perimentally suggested for a number of other polymerases 
(77). In Pol k candidate amino acid residues might be 
Argl44 and Lys328, but they are positioned differently 
than in the experimentally investigated polymerases 
(Figure 5). Nonetheless, we find that the local amino 
acids, Argl44, Lys328 and Thrl38 play a key role in an- 
choring the dCTP throughout the reaction and provide 
added stabilization to the transition state by a network 
of hydrogen bond interactions. Particularly important is 
the role of Argl44 whose multiple hydrogen bonding 
opportunities are dynamically utilized as the reaction 
progresses (Figure 5). Another important stabilizing 
hydrogen bonding interaction is provided by Tyrl41; 
this amino acid is hydrogen bonded with the y-phosphate 
via a water throughout the reaction, but is hydrogen 
bonded directly at the transition state. Accordingly one 
could propose that mutating Argl44, Lys328, Thrl38 
and Tyrl41, would impact the catalytic power of the 
enzyme, with Argl44 perhaps having the greatest effect. 
These proposed mutations offer suggestions for future ex- 
perimental studies. However, none of them would be 
required to directly act as a general acid. It is entirely 
plausible, since local organization of amino acid residues 
is polymerase dependent (77), and that these residues can 
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Figure 10. Pol k containing the B[a]P-dG adduct viewed into the 
minor groove in close-up (A) Prior to reaction with the 03'H 
designated by an orange circle (B) At the transition state with 
pentacovalent phosphorane designated by an orange circle; Color 
scheme: N-clasp domain, blue; B[a]P ring system, red; lesion-containing 
guanine, cyan; dCTP and the primer are colored by atom. See 
Supplementary Movie S3. 



function in different ways to facilitate the nucleotidyl 
transfer reaction in different polymerases. 

The nature of the general base for the deprotonation of 
the H(03') has been considered in previous calculations 
for other polymerases and attention has centered mainly 
on amino acids as the general base (83-86). However, a 
water mediated shuttle to the y-phosphate was confirmed 
for Dpo4 (87) as in our WMSA mechanism (35). Our 
previous studies did not support an amino acid as 
general base for Dpo4 or T7 DNA polymerase (35,36), 
and the current results do not support it for Pol k. 
However, polymerase dependence certainly remains 
possible and experimental insights could provide further 
elucidation. 

Of particular significance is our determination that the 
pentacovalent phosphorane transition state structure that 
we computed with QM/MM-MD methods for the un- 
modified DNA in Pol k remains intact in the presence of 
the B[fl]P-dG lesion, covalently linked to the templating 
guanine and oriented in the 5'-direction of the modified 
strand as in the NMR solution structure (26). The lesion 
has been shown to be bypassed near error-free in vitro by 
DNA Pol k (1 1), and Pol k bypasses this lesion with higher 
efficiency and accuracy than other Y family polymerases 
(23). Furthermore, we have shown previously using MD 
simulations of dCTP Watson-Crick paired with the 
templating B[a]P-dG in Pol k, that the pre-reactant state 
remains organized for nucleotidyl transfer (30). Our 
current work shows, importantly, that the transient 
pentacovalent phosphorane transition state, whose 



stability would be in the 0.01-0.1 ps time range (79), 
remains stable for over a nanosecond of MD simulation 
both with and without the presence of the B[a]P-dG lesion. 
These findings provide key determinations that this minor 
groove bulky lesion disturbs neither the pre-reactant nor 
the critical transition state on the reaction pathway. 
An experimental comparison is provided by steady state 
kinetic studies with this B[a]P-dG stereoisomeric lesion 
and incoming dCTP in Pol k. These studies (80) showed 
an increase in overall barrier to reaction from 18.7 to 
21.5 kcal/mol, i.e. 2.8kcal/mol, according to transition 
state theory. This modest difference does not appear to 
stem from impediments to the nucleotidyl transfer 
chemical reaction, but to other factors that are 
subsumed in the overall k cat that derive from the 
presence of the bulky polycyclic ring system, such as 
perhaps translocation (88). 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online: 
Supplementary Tables 1-3, Supplementary Figures 1-17, 
Supplementary Movies 1-3, Supplementary Compu- 
tational Details, Supplementary Scheme 1 and Supple- 
mentary References [30,32,34,46,47,54,65,89-97]. 
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